Inner Product Computation In-Memory Using Distributed Arithmetic

نویسندگان

چکیده

In-memory computing using emerging technologies such as Resistive Random-Access Memory (ReRAM) has been proposed a promising substitute for future applications to address the ‘von Neumann bottleneck’. Multiplication is key component inner product computation in every digital signal processing (DSP) application and complexity of multipliers increases greatly with bit-width. Distributed arithmetic (DA) look-up tables adder-shifter module achieve multiplier-less efficient DSP architectures, particularly when one vectors constant known advance. Due memory wall, DA can be made furthermore latency energy-efficient implemented ‘in memory’. In this work, first time, we propose two design techniques compute completely DA. This accomplished by storing precomputed table contents ReRAM array implementing also same array. The majority gates which are turn realized READ operations Two methods mapping: latency-optimized area-optimized their comparison terms area presented. method-1 achieves $\approx 60$ % energy savings compared CMOS method-2 10.59 times higher throughput CMOS.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Inner product computation for sparse iterative solvers on distributed supercomputer

Recent years have witnessed that iterative Krylov methods without re-designing are not suitable for distribute supercomputers because of intensive global communications. It is well accepted that re-engineering Krylov methods for prescribed computer architecture is necessary and important to achieve higher performance and scalability. The paper focuses on simple and practical ways to re-organize...

متن کامل

Arithmetic inner product formula for unitary groups

Arithmetic inner product formula for unitary groups

متن کامل

distributed arithmetic با کارایی بالا

در این تحقیق، روش های پیاده سازی فیلتر با پاسخ ضربه محدود (فیلتر fir) مورد بررسی قرار می گیرد. برای پیاده سازی فیلتر fir از دو واحد(mac) multiply accumulate و distributed arithmetic (da) استفاده می شود. پیاده سازی فیلتر fir با da از 50 تا 80 درصد مساحت اشغالی را بهبود می دهد؛ همچنین توان مصرفی را نیز کاهش می دهد. روش هایی برای بهبود ساختار اولیه ی da نیز ارائه می شود که باعث افزایش کارایی فیلتر...

15 صفحه اول

Fast Inner Product Computation on Short Buses

This Article is brought to you for free and open access by the Computer Science at ODU Digital Commons. It has been accepted for inclusion in Computer Science Faculty Publications by an authorized administrator of ODU Digital Commons. For more information, please contact [email protected]. Repository Citation Lin, R. and Olariu, S., "Fast Inner Product Computation on Short Buses" (2002). C...

متن کامل

Frames in 2-inner Product Spaces

In this paper, we introduce the notion of a frame in a 2- inner product space and give some characterizations. These frames can be considered as a usual frame in a Hilbert space, so they share many useful properties with frames.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Circuits and Systems I-regular Papers

سال: 2022

ISSN: ['1549-8328', '1558-0806']

DOI: https://doi.org/10.1109/tcsi.2022.3193678